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ABSTRACT 

The introduction of low-frequency radio arrays is expected to revolutionize the 
study of the reionization epoch. Observation of the contrast in redshifted 21cm emis- 
sion between a large HII region and the surrounding neutral inter-galactic medium 
(IGM) will be the simplest and most easily interpreted signature. However the highest 
redshift quasars known are thought to reside in an ionized IGM. Using a semi-analytic 
model we describe the redshifted 21cm signal from the IGM surrounding quasars dis- 
covered using the i-drop out technique (i.e. quasars at z ~ 6). We argue that while 
quasars at z < 6.5 seem to reside in the post overlap IGM, they will still provide valu- 
able probes of the late stages of the overlap era because the light-travel time across 
a quasar proximity zone should be comparable to the duration of overlap. For red- 
shifted 21cm observations within a 32MHz bandpass, we find that the subtraction of 
a spectrally smooth foreground will not remove spectral features due to the proximity 
zone. These features could be used to measure the neutral hydrogen content of the 
IGM during the late stages of reionization. The density of quasars at z ~ 6 is now 
well constrained. We use the measured quasar luminosity function to estimate the 
prospects for discovery of high redshift quasars in fields that will be observed by the 
Murchison Widefield Array. 

Key words: cosmology: diffuse radiation, large scale structure, theory - galaxies: 
high redshift, inter-galactic medium 



1 INTRODUCTION 

The reionization of cosmic hydrogen by the first stars and 
quasars (e.g. Barkana & Loeb 2001), was an important mile- 
stone in the history of the Universe. The recent discovery of 
distant quasars has allowed detailed absorption studies of 
the state of the high redshift intergalactic medium (IGM) 
at a time when the universe was less than a billion years old 
(Fan et al. 2006; White et al. 2003). Several studies have 
used the evolution of the ionizing background inferred from 
these spectra to argue that the reionization of cosmic hy- 
drogen was completed just beyond z ~ 6 (Fan et al. 2006; 
Gnedin & Fan 2006; White et al. 2003). However, other au- 
thors have claimed that the evidence for this rapid change 
becomes significantly weaker for a different choice of density 
distribution in the IGM (Becker et al. 2007). Different ar- 
guments in favour of a rapidly evolving IGM at z > 6 are 
based on the properties of the putative HII regions inferred 
around the highest redshift quasars (Wyithe & Loeb 2004; 
Wyithe, Loeb, & Carilli 2005; Mesinger & Haiman 2005). 
However, Bolton & Haehnelt (2007a) and Lidz et al. (2007) 



have demonstrated that the interpretation of the spectral 
features is uncertain and that the observed spectra could 
either be produced by an HII region, or by a classical prox- 
imity zone. One reason for the ambiguity in interpreting 
these absorption spectra is that Lya absorption can only be 
used to probe neutral fractions that are smaller than 10 -3 
owing to the large cross-section of the Lya resonance. Thus 
studies of the IGM in Lya absorption become inconclusive 
in the era of interest for reionization. 

On the other hand there is mounting evidence that the 
reionization of the IGM was photon starved. Firstly Bolton 
& Haehnelt (2007b) have shown that the observed ionization 
rate at z < 6 implies an emissivity that is only just suffi- 
cient to have reionized the universe by that time. Similarly, 
the small escape fractions found for high redshift galaxies 
by several studies (Chen et al. 2007; Gnedin et al. 2007; 
Srbinovsky & Wyithe 2008) together with the star forma- 
tion rates implied by the observed high redshift galaxy pop- 
ulation suggest a photon budget that struggles to have been 
sufficient to reionize the universe by z ~ 6 (Gnedin 2007; 
Srbinovsky & Wyithe 2008) . These results imply that while 
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the IGM seems to be highly ionized along the lines-of-sight 
towards the highest redshift quasars discovered in current 
surveys, the reionization epoch cannot be at a substantially 
higher redshift unless the emissivity grows significantly at 
z > 6. Indeed this outcome may be suggested by the large 
optical depth to electron scattering measured by the WMAP 
satellite (Spergel et al. 2007). 

A better probe of the process of reionization will be pro- 
vided by redshifted 21cm observations. Reionization starts 
with ionized (HII) regions around galaxies, which later grow 
to surround groups of galaxies. The process of reionization is 
completed when these HII regions overlap (defining the so- 
called overlap epoch) and fill-up most of the volume between 
galaxies. Several probes of the reionization epoch in red- 
shifted 21cm emission have been suggested in a large body 
of literature. These include observation of 21cm emission as 
a function of redshift averaged over a large area of the sky. 
This provides a direct probe of the evolution in the neutral 
fraction of the IGM, the so-called global step (Shaver, Wind- 
horst, Madau & de Bruyn 1999; Gnedin & Shaver 2004). A 
more powerful probe will be provided by observation of the 
power-spectrum of fluctuations together with its evolution 
with redshift. This observation would trace the evolution of 
neutral gas with redshift as well as the topology of the reion- 
ization process (e.g. Tozzi, Madau, Meiksin & Rees 2000; 
Furlanetto, Hernquist & Zaldarriaga 2004; Loeb & Zaldar- 
riaga 2004; Barkana & Loeb 2005a, b,c). It is thought that 
the amplitude of 21cm fluctuations will be greatest when the 
neutral fraction in the IGM is around 50% (Furlanetto et 
al. 2004; Lidz et al. 2007). Thus, while the power-spectrum 
should prove to be the best technique for study of the bulk 
of the reionization epoch, it may not be a sensitive probe of 
the very late stages of the overlap era. 

Finally, observations of individual quasar HII regions 
will probe quasar physics as well as the evolution of the 
neutral gas in the surrounding IGM (Wyithe & Loeb 2004b; 
Kohler, Gnedin, Miralda-Escude & Shaver 2005). Kohler et 
al. (2005) have generated synthetic spectra using cosmolog- 
ical simulations. They conclude that quasar HII regions will 
provide the most prominent individual cosmological signals. 
These individual signatures will be most readily detected 
a-posteriori, around known high redshift quasars (Wyithe, 
Loeb & Barnes 2005; Geil & Wyithe 2007). These studies 
have focused on the scenario of a quasar expanding into a 
significantly neutral IGM. However the density of quasars 
is very low at high redshift, while as discussed above, the 
IGM allows substantial Lya transmission and so is thought 
to be highly ionized along the lines-of-sight to nearly all of 
the known high redshift quasars. 

The conventional wisdom has been that the 21cm signal 
disappears after the overlap epoch is complete, because there 
is little neutral hydrogen left through most of intergalactic 
space. However observations of damped Lya systems out to 
a redshift of z ~ 4 show the cosmological density parameter 
of HI to be fl H i ~ 10~ 3 (Prochaska et al. 2005). In the stan- 
dard cosmological model the density parameter of baryons 
is fib ~ 0.04, so that the mass-averaged neutral hydrogen 
fraction at z ~ 4 (long after the end of the HII overlap 
epoch) is F m ~ 0.03. This neutral gas does not contribute 
significantly to the effective Lya optical depth, which is sen- 
sitive to the volume averaged neutral fraction (with a value 
that is orders of magnitude lower). However the redshifted 



21cm emission is sensitive to the total (mass- weighted) opti- 
cal depth of this neutral gas. Observations of the redshifted 
21cm signal would therefore detect the total neutral hydro- 
gen content in a volume of IGM dictated by the observatory 
beam and frequency band-pass (Wyithe & Loeb 2007) . Since 
quasars could be observed through the entire overlap epoch, 
redshifted 21cm observations of the surrounding IGM could 
provide a bridge between 21cm fluctuations at high redshift 
and the well studied techniques utilizing the Lya forest fol- 
lowing the completion of reionization. 

We begin by describing our density dependent semi- 
analytic model for the reionization history (§H|. We next 
describe our calculation of the depletion of neutral hydrogen 
near the vicinity of a high redshift quasar ( § (3j . Then in § [4] 
and §[5]we describe the 21cm signal from the proximity zones 
and estimate of the effect of foreground removal. Finally we 
summarize existing observations of the high redshift quasar 
luminosity function (§|6}, and predict the number that will 
be found in future surveys (§ before presenting our con- 
clusions in § [8] Throughout the paper we adopt the set of 
cosmological parameters determined by WMAP (Spergel et 
al. 2007) for a flat ACDM universe, namely Q m = 0.24, 
Qa = 0.76 and h — 0.73. In computation of the mass func- 
tion we assume a primordial power spectrum defined by a 
power law with index n — 0.95, an exact transfer function 
given by Bardeen et al. (1986) and rms mass density fluctu- 
ations with a sphere of radius Rg = 8ft _1 Mpc of as = 0.76. 



2 SEMI-ANALYTIC MODEL FOR THE 
REIONIZATION HISTORY 

In this section we describe the semi-analytic model which 
we use to describe the ionization state of the IGM during 
the reionization history of the IGM. Our model is based on 
the work of Miralda-Escude et al. (2000) who presented a 
prescription which allows the calculation of an effective re- 
combination rate in an inhomogeneous universe by assum- 
ing a maximum over density (A c ) penetrated by ionizing 
photons within HII regions. The model assumes that reion- 
ization progresses rapidly through islands of lower density 
prior to the overlap of individual cosmological ionized re- 
gions. Following the overlap epoch, the remaining regions of 
high density are gradually ionized. It is therefore hypothe- 
sized that at any time, regions with gas below some critical 
over density Ai = pi/ (p) are highly ionized while regions 
of higher density are not. In what follows, we draw pri- 
marily from their prescription and refer the reader to the 
original paper for a detailed discussion of its motivations 
and assumptions. Wyithe & Loeb (2003) employed this pre- 
scription within a semi-analytic model of reionization. This 
model was extended by Srbinovsky & Wyithe (2007) and 
by Wyithe, Bolton & Haehnelt (2007). We summarise the 
model in the remainder of this section, but refer the reader 
to those papers for a full description. 

Within the model of Miralda-Escude et al. (2000) we 
describe the post-overlap evolution of the IGM by comput- 
ing the evolution of the fraction of mass in regions with over 
density below Ai, 

F M (A;) = / A 'dAP v (A)A, (1) 
Jo 
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Figure 1. The effect of over density on the redshift of overlap, and the subsequent ionization state of the IGM. Five cases are shown, 
corresponding to over densities evaluated within spheres with radii of 3, 5, 10 and 25 physical Mpc, centered on a quasar host of mass 
M = 10 13 Af o at z ~ 6. In each case we evaluate the reionization history assuming the mean over density surrounding the quasar. 
Upper Left Panel: The ionization rate as a function of redshift. The observational points are from Bolton et al. (2007b). Upper Right 
Panel: The volume (lower curves) and mass (upper curves) averaged fractions of neutral gas in the universe. Also shown (dotted lines) 
is the fraction of the IGM yet to overlap (1 — Qi). The observational points for the volume averaged neutral fraction are from Bolton et 
al. (2007b), while the observed mass-fractions are from the damped Lyc* measurements of Prochaska et al. (2005). Lower Left Panel: The 
mean-free-path for ionizing photons computed using the formalism in §[2] The data points are based on Storrie-Lombardi et al. (1994). 
Lower Right Panel: The evolution of the mean 21cm brightness temperature (in mK) with redshift (solid lines). For comparison, the 
fraction of IGM yet to overlap (1 — Qi) is over plotted. 



where Pv(A) is the volume weighted probability distribution 
for A. Miralda-Escude et al. (2000) quote a fitting function 
which provides a good fit to the volume weighted probability 
distribution for the baryon density in cosmological hydrody- 
namical simulations. This probability distribution remains 
a reasonable description at high redshift when confronted 
with a more modern cosmology and updated simulations, 
although the addition of an analytical approximation for the 
high density tail of the distribution remains necessary as a 
best guess at correcting for numerical resolution (Bolton & 
Haehnelt 2007b). 



In the post overlap era the model computes the evo- 
lution of Ai. In the pre-overlap era we define the quantity 
Qi to be the volume filling factor within which all matter 
at densities below Ai has been ionized. Within this formal- 
ism, the epoch of overlap is precisely defined as the time 
when Qi reaches unity. However, prior to overlap we have 
only a single equation to describe the evolution of two in- 
dependent quantities Qi and Fm (or equivalently Ai). The 
relative growth of these depends on the luminosity func- 
tion and spatial distribution of the sources. In our model 
we follow Miralda-Escude et al. (2000) and assume Ai to 



be constant (of value A c ) with redshift before the overlap 
epoch, and in this paper compute results for models that 
assume A c = 20. Our approach is to compute a reioniza- 
tion history given a particular value of A c , combined with 
assumed values for the efficiency of star-formation and the 
fraction of ionizing photons that escape from galaxies. With 
this history in place we then compute the evolution of the 
background radiation field due to these same sources. After 
the overlap epoch, ionizing photons will experience attenua- 
tion due to residual over dense pockets of HI gas. We use the 
description of Miralda-Escude et al. (2000) to estimate the 
ionizing photon mean-free-path, and subsequently derive the 

1 We note that the assumption of a fixed value of A c (and hence a 
slowly evolving mean-free-path) in the pre overlap era is artificial. 
Indeed, A c is probably only a meaningful quantity after overlap 
is complete and the mean-free-path is set by dense systems, while 
before overlap the mean-free-path is sensitive to the the size of 
HII regions (which increases with redshift). However, within the 
model of Miralda-Escude et al. (2000), the value of A c and the 
ionization fraction are both unknowns prior to overlap, with only 
one equation to govern their evolution. An assumption regarding 
A c is therefore unavoidable within the formalism used in this 
paper. 
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attenuation of ionizing photons. We then compute the flux 
at the Lyman-limit in the IGM due to sources immediate 
to each epoch, in addition to redshifted contributions from 
earlier epochs. 

We assume the spectral energy distribution (SED) of 
population-II star forming galaxies with a gas metalicity of 
0.05 and a Scalo IMF, using the model presented in Leitherer 
et al. (1999). The star formation rate per unit volume is 
computed based on the collapsed fraction obtained from the 
extended Press-Schechter (1974) model (Bond et al. 1991) 
in halos above the minimum halo mass for star formation, 
together with an assumed star formation efficiency (/*). In 
a cold neutral IGM beyond the redshift of reionization, the 
collapsed fraction should be computed for halos of sufficient 
mass to initiate star formation. The critical virial tempera- 
ture is set by the temperature (Tn ~ 10 4 K) above which 
efficient atomic hydrogen cooling promotes star formation. 
Following the reionization of a region, the Jeans mass in 
the heated IGM limits accretion to halos above T\ ~ 10 5 
K (Efstathiou 1992; Thoul & Weinberg 1996; Dijkstra et 
al. 2004). Only a fraction of ionizing photons produced by 
stars enter the IGM. Therefore an additional factor of / esc 
(the escape fraction) must be included when computing the 
emissivity of galaxies. In our fiducial model we assume this 
escape fraction to be independent of mass. We define a pa- 
rameter /* :CSC = /*/ eB o- 

Figure [1] shows models for the reionization of the IGM 
and the subsequent post-overlap evolution of the ionizing 
radiation field. The fiducial model (shown by the thick grey 
curves) has /*, csc = 0.00375. Our model allows the bias 
of reionization near a massive halo to be included explic- 
itly, and we show histories corresponding to regions within 
R = 3, 5, 10 and 25 proper Mpc surrounding a quasar host 
of mass M = 10 13 M Q . In the top left panel of Figure Q] we 
show the evolution of the ionization rate. The observational 
points are from the simulations of Bolton et al. (2007b; based 
on the observations of Fan et al. 2006). In the upper- right 
panels we plot the corresponding volume and mass (upper 
curves) averaged fractions of neutral gas in the universe. 
The observational points for the volume averaged neutral 
fraction are from Bolton et al. (2007b), while the observed 
mass-fractions are from the damped Lya measurements of 
Prochaska et al. (2005) , and therefore represent lower limits 
on the total HI content of the IGM. Both curves show excel- 
lent agreement with these quantities, despite their differing 
by 3 orders of magnitude. In computing the volume averaged 
neutral fraction we have followed standard practice and as- 
sumed ionization equilibrium with an ionizing background 
at all over densities. However in an IGM that includes dense 
regions that are self-shielded, this value under estimates the 
true value. We note that the inclusion of fully neutral gas at 
densities above the self shielding over density does not mod- 
ify the predicted value of effective Lya transmission, from 
which IGM properties are inferred. However neutral hydro- 
gen above the self-shielding threshold does contribute signif- 
icantly to the volume averaged neutral fraction (in addition 
of course to the mass averaged neutral fraction) interpreted 
from Lya absorption spectra. 

In the lower-left panel we plot the evolution of the ion- 
izing photon mean-free-path. The data points are based on 
Storrie-Lombardi et al. (1994). Again the model is in good 
agreement with the available observations. We note that the 



observed mean-free-path is found from the number density 
of Ly-limit systems and is independent of the Lya forest 
absorption derived quantities of ionization rate and volume 
averaged neutral fraction, as well as being independent of the 
HI mass-density measurements. Our simple model therefore 
simultaneously reproduces the evolution of three indepen- 
dent measured quantities. In the lower-right panel we plot 
the corresponding evolution of the 21cm brightness temper- 
ature (dark lines). The grey lines show the evolution of the 
filling factor of ionized regions (1 — Qi). 



3 QUASAR PROXIMITY ZONES DURING 
OVERLAP 

The enhanced ionization rate near quasars at moderate red- 
shift produces a region of thinned Lya forest that extends 
for several Mpc (e.g. Scott et al. 2000). This thinning of the 
Lya forest is termed the proximity effect and the region of 
enhanced ionization the proximity zone. Prior to the end of 
reionization the proximity zone is not defined since there is 
no ionizing background and instead the quasar contributes 
to an enlarged, distinct HII region. In this section we aim 
to model the effect of the quasar flux on the mass averaged 
neutral fraction within the proximity zones of quasars dur- 
ing the overlap era. 

Our semi-analytic model provides a framework within 
which to model the depletion of neutral hydrogen within 
the proximity zone. Since gas is neutral at over densities 
above Ai, we must estimate the change in A; induced by 
the quasar. We begin with the ionization rate from our semi- 
analytic model as a function of radius from the quasar host. 
We then add the ionization rate due to a quasar with lumi- 
nosity of 0.7 x 10 57 s _1 , which corresponds to a quasar with 
an absolute luminosity that is around a magnitude fainter 
than the most luminous SDSS z ~ 6 quasars. This process 
includes calculation of biased reionization within the vicin- 
ity of the quasar host, but does not include the increase 
in ionizing photon mean-free-path that would result from 
the presence of quasar flux. We therefore underestimate the 
ionization rate in the vicinity of the quasar. The most over 
dense regions of the IGM are self shielding to ionizing radia- 
tion. The over density of a clump at which gas becomes self 
shielding may be estimated from (Furlanetto & Oh 2005; 
Bolton & Haehnelt 2007b) 

A S sc = 50iv(i±iy 3 r^, (2) 

where we have neglected the mild dependence on temper- 
ature. This expression assumes that the typical size of an 
absorber with over density A is the local Jeans length, and 
that the absorber becomes optically thick to Lyman limit 
photons when the column density iVm exceeds Na^ , where 
am is the hydrogen photo-ionization cross-section at the Ly- 
man limit. The coefficient N has been previously assumed 
to equal unity, but is somewhat arbitrary, and we discuss its 
value below. 

Our model for the reionization of the IGM surrounding 
a quasar is not internally consistent, which would require a 
full numerical simulation. On the other hand, such a simu- 
lation is currently beyond the available numerical resolution 
over volumes sufficiently large to host a high redshift quasar 
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Figure 2. Impact of the quasar flux on local IGM properties. Left: Lyce transmission as a function of distance from the quasar in units 
of proper Mpc (dark curves). The shaded region shows the limit on transmission in the Gunn- Peterson troughs of the deepest available 
spectra. Right: Mass weighted neutral fraction as a function of distance from the quasar (dark curves) . In each panel examples are shown 
corresponding to 4 different quasar redshifts. Also shown are the corresponding examples for the mean IGM, centered on the assumed 
quasar redshifts (grey curves). 



proximity zone, while the damped Lycv systems thought to 
dominate the high redshift neutral gas are currently sub- 
ject to many model uncertainties (Nagamine et al. 2007). 
The inconsistency within our model can be traced partly 
to the fact that Ai is computed in a non-equilibrium con- 
dition as reionization progresses, while Assc is computed 
in ionization equilibrium with an estimate of the ionizing 
flux. However most importantly, one has to assume a density 
profile in order to calculate a column density. The density 
profiles which are assumed in the distribution Pv(A) differ 
from the top-hat density profile assumed in calculation of 
Assc (Furlanetto & Oh 2005). In order to make the model 
internally consistent we therefore choose the value of N in 
equation ([2| at each redshift such that Assc = Ai in the 
mean ionizing background (i.e. far from the quasar). The 
mass averaged neutral fraction within the proximity zone at 
a radius 7? is then obtained from 

F M (R)= [ ASSC dAAPv(A)x m , (3) 
Jo 

where xui is the neutral fraction of hydrogen, which is 
evaluated assuming ionization equilibrium at over densities 
A < Assc, and is equal to unity when A > Assc- 

An important ingredient in our modeling is the compu- 
tation of the local ionization rate at the retarded time along 
the line of sight. The use of retarded time is particularly 
important for quasars observed near the end of reionization 
since the ionization state of the IGM changes dramatically 
during the light travel time of a quasar proximity zone. In 
regions of the IGM observed in front of the quasar (i.e. at 
lower redshift) we assume ionization equilibrium of the hy- 
drogen with the sum of quasar and ionizing back ground 
flux at all radii. This is because the IGM is observed in pho- 
tons that arrive at the observer at the same time as photons 
emitted by the quasar (either when observed in absorption 
against the quasar, or in 21cm emission). However the 21cm 
emission from IGM behind the quasar can only be subject 

2 The modification of the value N in this process could be 
thought of as providing a correction that accounts for the effect 
of the density profile on the column density. 



to ionization by quasar flux if it is located within a distance 
R — ct q /2 of the quasar (where t q is the quasar lifetime). 

The resulting model proximity zones, described by the 
mass averaged neutral fraction as a function of distance from 
the quasar (dark curves) are plotted in the right-hand panel 
of Figure [2] Since the neutral gas in confined to discrete 
clumps following reionization, an individual line of sight 
through a proximity zone will not have a smooth profile in 
neutral hydrogen density. However our semi-analytic model 
is unable to compute individual realizations of the IGM, but 
rather computes the average behaviour which is represented 
by a smooth profile. Thus, examples of these smooth pro- 
files are shown corresponding to 4 different quasar redshifts 
which cover the end of the overlap epoch (as predicted within 
our model). The curves show a proximity zone extending to 
around 10 proper Mpc. The curves also show an asymmetry 
in the proximity zone, which has a greater contrast behind 
the quasar. This asymmetry is largest earlier in the overlap 
era. Also shown for comparison are the corresponding ex- 
amples for the mean IGM, centered on the assumed quasar 
redshifts (grey curves). 

We also compute the Lya transmission as a function of 
distance from the quasar, and plot the results in the left- 
hand panel of Figure [5] These transmission curves may be 
compared with observations of high redshift quasars. The 
deepest spectra of high redshift quasars reach a limit of T ~ 
0.002 towards quasars at z ~ 6.3 — 6.4. This limit is shaded 
grey. Our model predicts that the Gunn-Peterson Trough 
will appear in the spectra of quasars at z ~ 6.4 (in agreement 
with observation), even though the IGM is highly ionized. 
The model Gunn-Peterson trough begins at a distance of 
approximately 7-8Mpc from the quasar and extends for ~ 
lOMpc. The corresponding examples of transmission for the 
mean IGM, centered on the assumed quasar redshifts are 
plotted for comparison (grey curves). In this example the 
quasar has little impact on the redshift at which the Gunn- 
Peterson trough would appear, since the onset is at a large 
distance from the quasar. 

In the left hand panel of Figure [3] we plot the mass 
weighted neutral fraction as a function of redshift. The re- 
sults of our modeling for the mass fraction in the vicinity of 
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Figure 3. Left: Mass weighted neutral fraction as a function of redshift. The evolution for the mean IGM with redshift is shown as the 
grey line. The results of our modeling for the mass fraction in the vicinity of a quasar are shown as the dark curves for quasars at three 
different redshifts. Right: The evolution of Ai and N as a function of redshift. 



a quasar are shown as the dark curves for quasars at three 
different redshifts. For comparison the evolution of the mean 
IGM with redshift is shown as the grey line. On the right 
hand side we show the value of the pre factor N in equa- 
tion @ for Assc, as well as Ai. Our model has N ~ 4 
for most of the redshift range of interest, and shows that 
while it is not constant, N evolves much more slowly than 
Ai. Before proceeding, we note that the quantitative predic- 
tions of our model will be sensitive to the applicability of 
the assumed distribution of over densities Pv(A), which is 
not directly measured in numerical simulations (Bolton & 
Haehnelt 2007b). 



4 21CM OBSERVATIONS OF QUASAR 
PROXIMITY ZONES 

We next estimate the 21cm signal corresponding to the prox- 
imity zones described in the previous section. The 21cm 
brightness temperature contrast corresponding to IGM at 
the mean density is 

T(R) = 22mK I ^— ) (1 - Q ; F M (Ai, R)). (4) 

The resulting 21cm brightness temperature profiles as a 
function of observed frequency are shown in Figure [4] (dark 
lines). For a quasar at z ~ 6.6 the model predicts that the 
expected contrast in front of the quasar is only ~ lmK, while 
at redshifts beyond the quasar the contrast would be as large 
as 5mK. On the other hand, around a quasar at z ~ 6.0 the 
contrast would only by ~ 0.5 — lmK. Also shown for com- 
parison are the corresponding examples for the mean IGM, 
centered on the assumed quasar redshifts (grey curves). 

We estimate the uncertainty for observations using the 
configuration of the Murchison Widefield Array (MWA0), 
which is currently under construction and will comprise a 
phased array of 500 tiles (each tile will contain 16 cross- 
dipoles) distributed over an area with diameter 1.5km. The 
uncertainty was computed assuming 1000 hours of observing 
time. When forming a map from the available visibilities it 
is assumed that resolution has been compromised for lower 

3 see http://www.haystack.mit.edu/ast/arrays/mwa/index.html 



noise in the image by choosing a maximum baseline to be in- 
cluded (Geil & Wyithe 2007). We chose a synthesised beam 
(full beam width at half maximum) Of 6>bcam = 3.2' which 
would be appropriate for quasar proximity zones. The ther- 
mal noise corresponding to this angular scale for the MWA 
is (Geil & Wyithe 2007) 

/-, , \ 2.6 / « \ -0.5 / , \ -0.5 

-="» K (w) (=) (iss) • 

(5) 

where Av is the width of the frequency bin and tint is the in- 
tegration time. At z ^ 6.4, the error bars shown correspond 
to an observation of a single quasar, while at z = 6.2 and 
z — 6.0 the errors refer to the average signal from stacks of 

3 and 10 quasars respectively. These numbers are motivated 
by the expected number counts in planned surveys and will 
be discussed in § [7] The sizes of the error bars correspond 
to binning over the interval in between the points shown (so 
that the errors would be independent). The proximity zones 
would be detectable with good significance in the scenario 
described. 

We have computed line-of-sight 21cm spectra, while ob- 
servations will be made at finite resolution. In the case of 
spherical proximity zones, finite resolution will introduce 
smoothing across the boundaries in observed 21cm spectra. 
For this reason we have restricted our analysis to spectra 
measured within a single synthesised beam centered on the 
quasar line-of-sight. If the proximity zone were spherical, the 
transverse size of the synthesised beam (#bcam = 3.2') at 
the edge of the proximity zone would subtend ~ 20 degrees, 
making the smoothing negligible. Of course the proximity 
zone is unlikely to be spherical. Suppose that the quasar 
were beamed with an opening angle a. If the transverse ex- 
tent of the synthesised beam were wider than the emission 
region at the edge of the proximity zone, we would again 
expect smoothing of the boundary in the observed 21cm 
spectrum. However, given #bcam = 3.2', the above argument 
implies that the spectrum should not be subject to smooth- 
ing so long as a > 20 degrees (unless the quasar beaming is 
misaligned with the line of sight by ~ a). 

4 This corresponds to 5.5' central peak to first null, which is often 
quoted as the resolution. 
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Figure 4. The 21cm brightness temperature as a function of observed frequency. The four panels show examples corresponding to 4 
different quasar redshifts. Also shown are the corresponding examples for the mean IGM, centered on the assumed quasar redshifts 
(grey curves). At z ^ 6.4, the error bars shown correspond to an observation of 1000 hours with the MWA using a maximum baseline 
corresponding to a 3.2' beam. At z = 6.2 and z = 6.0 the errors assume an average signal from stacks of 3 and 10 quasars respectively. 



Furthermore, in the examples shown at z ^ 6.2 we 
have assumed that the spectra from several quasars with 
the same luminosity and redshift could be stacked. In prac- 
tice this process would be subject to several complications. 
First, the high redshift edge of the QSO proximity zone may 
lie in the pre-overlap era. In this case we would expect sig- 
nificant variation between quasars owing to the patchiness 
of reionization. In addition, the separation in redshift of the 
quasars available might exceed the depth of the proximity 
zones. Combining the spectra of different quasars in order 
to increase signal-to-noise would therefore be non-trivial. 

Following the completion of overlap in a region of IGM 
the neutral gas is in a collection of dense pockets rather than 
being diffusely distributed in the IGM. There is therefore an 
additional component of uncertainty for the spectra shown 
in Figure [4] owing to the finite number of emission sources 
that contribute to the 21cm signal. Given a beam radius 
#t>cam and mass-averaged neutral fraction Fm, there are 




emission sources within a frequency bin of width Vbin assum- 
ing the neutral clumps to have a baryonic mass M c . Thus, we 
estimate that if the baryonic mass is limited to be the Jeans 
mass in an ionized IGM (M c ~ 1O 9 M0) then the component 
of uncertainty in a Vbin ~ 5MHz bin is around 5%, while the 



existence of lower mass neutral clumps survived from the 
pre-reionization IGM would lead to an even smaller Pois- 
son contribution. Of course a full calculation of N c \ would 
require a numerical simulation to resolve the details of the 
damped Lya and Ly-limit systems. However equation @ 
suggests that the finite distribution of neutral clumps will 
not contribute the dominant source of uncertainty in 21cm 
observations of quasar proximity zones with the MWA. 



5 THE EFFECT OF FOREGROUND 
SUBTRACTION 

The detection of a proximity zone will require the subtrac- 
tion of a large foreground contribution to the redshifted 
21cm signal. This foreground is thought to be dominated 
by synchrotron radiation, both galactic and extra galactic, 
and to be spectrally smooth (Oh & Mack 2003; Di Mat- 
teo et al. 2002). The processed band-pass of the MWA will 
be Av = 32MHz, which corresponds to a physical length of 
~ 50Mpc at 2 ~ 6. Thus the band pass that will be available 
to the MWA is similar in length to the profiles presented in 
Figure 13] The process of foreground subtraction will render 
any line-of-sight fluctuations with wavelengths comparable 
to, or larger than the band-pass undetectable. Thus, we ex- 
pect to lose the overall trend of the emission with redshift 
across the band pass. 

We take a simple approach to estimate the impact of 
foreground removal, fitting and subtracting a fourth order 
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Figure 5. The 21cm brightness temperature as a function of observed frequency following subtraction of the best fit 4th order polynomial. 
The four panels show examples corresponding to the 4 different quasar redshifts described in Figure [4] Also shown are the corresponding 
examples for the mean IGM, centered on the assumed quasar redshifts (grey curves). At z ^ 6.2, the error bars shown correspond to an 
observation of 1000 hours with the MWA using a maximum baseline corresponding to a 3.2' beam. At z = 6.2 and z = 6.0 the errors 
assume an average signal from stacks of 3 and 10 quasars respectively. 



polynomial from the model spectra in Figure [4] The result- 
ing profiles are shown in Figure [5] (dark lines). We see that 
the subtraction removes the low order fluctuations, including 
the overall rise in intensity across the band-pass. However 
the subtraction leaves fluctuations around the slow rise due 
to the proximity zone. These fluctuations will be detectable, 
and their amplitude will yield the mass-averaged neutral 
fraction of hydrogen in the IGM at redshifts near that of 
the quasar. The asymmetry of the HII region, which is clear 
in Figure [4] would be difficult to detect in the foreground 
removed spectra shown in Figure [5] We will quantify this 
statement below. 

In Figure [5] we also show the corresponding 21cm spec- 
tra for the mean IGM centered on the assumed quasar red- 
shifts, with a fourth order polynomial fit subtracted as be- 
fore. The resulting profiles are shown as the grey curves. 
These profiles have no detectable fluctuations. This is be- 
cause the overlap epoch will take place over a range of 
redshifts that is larger than the frequency bandpass of the 
MWA. We therefore find that the global step will be unde- 
tectable by the MWA (or any instrument with a comparable 
bandpass) . Note in this context that the modeling presented 
yields a global step that is as rapid as allowed in a standard 
cosmological scenario. 

We next quantify the significance with which the quasar 
proximity zones could be detected in redshifted 21cm spec- 
tra. Our discussion is limited to determination of the confi- 



dence with which the spectral feature at the redshift of the 
quasar (and its asymmetry, see Figure [4} could be detected. 
We define detection as the statistically significant rejection 
of a null-hypothesis comprised of the best fit 4th order poly- 
nomial to the spectra shown in Figure [4] Our approach is 
to compute a set of Monte-Carlo realisations of the 21cm 
spectrum by adding Gaussian noise to the model spectra 
shown in Figure [4] The Gaussian noise is assumed to have 
a distribution of variance equal to the noise shown in Fig- 
ures |H5l Using this set of noisy model spectra we construct 
the cumulative distribution of the confidence with which the 
HII region can be distinguished from a smooth (4th order 
polynomial) evolution in the 21cm signal. Specifically, we 
construct the cumulative distribution of the confidence C, 
defined as the probability that a value of \ 2 larger than ob- 
served would arise by chance given the null-hypothesis con- 
sisting of the best fit 4th order polynomial. The resulting 
distributions are plotted in the left hand panel of Figure |SJ 
at four redshifts corresponding to the examples shown in 
Figures 1415 1 As before we assume a noise level correspond- 
ing to a single quasar at z ^ 6.4, and stacks of 3 and 10 
quasars at z — 6.2 and z — 6 respectively. Note that we 
have fitted spectra comprised of 10 points with a 5 param- 
eter polynomial fit, leaving only 5 degrees of freedom. As a 
result any residuals left following subtraction of the fit lead 
to a high confidence for rejection of the null hypothesis. In 
these examples, the spectral dip could be detected with 90% 
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Figure 6. Left panel: Monte-Carlo realisations of the confidence with which the proximity zone can be distinguished from a smooth 
(modeled as a 4th order polynomial) evolution in the 21cm signal. Specifically, the quantity C is the probability that a value of \ 2 
larger than that observed would arise by chance given a null-hypothesis consisting of the best fit 4th order polynomial. Right panel: The 
corresponding distributions for spectra of the mean IGM. In each case the plot shows the cumulative distribution of C, at 4 redshifts 
corresponding to the examples shown in Figures 14151 Also shown for comparison is the cumulative probability distribution Pu n (< C) = C 
(thick grey lines). 



confidence in 50% of cases for 6.0 < z < 6.4. At z ~ 6.6 the 
spectral feature would be detected with 99% confidence in 
more than 50% of cases. 

In the right hand panel of Figure [6] we show the corre- 
sponding cumulative probability distributions for the con- 
fidence C of detecting a departure of the mean IGM spec- 
trum from the 4th order polynomial best fit. In each case we 
show cumulative distributions for C computed from mean 
IGM spectra centered at four redshifts corresponding to the 
quasars in Figures T4I5I The probability of the observed spec- 
trum being inconsistent (in a \ 2 sense) with the best fit 4th 
order polynomial is significantly less for the mean IGM spec- 
trum than for the quasar spectra, indicating that the mean 
IGM signal is better modeled by a 4th order polynomial 
than a quasar near-zone. 

For comparison, we also plot the cumulative distribu- 
tion Pn n (< C) — C in both panels of Figure |S] (thick 
grey lines). In cases where the 4th order polynomial per- 
fectly models the spectrum, we would expect no deviation 
of P(< C) from Pn n (< C). Figure HJ] therefore illustrates 
that the probability of a large value of \ 2 is well m excess of 
random for spectra of quasar near zones at all redshifts con- 
sidered. On the other hand, for spectra of the mean IGM, 
we find that the three higher redshifts considered (z ^ 6.2) 
have distributions P(< C) that are nearly indistinguishable 
from Pn n , as might be expected from the very smooth mean 
IGM spectra shown in Figure |2J At z = 6.0 the 4th order 
polynomial does not adequately describe the spectrum at 
the end of overlap. As a result, P{< C) > Pn n (< C). None- 
theless, as mentioned above, P(< C) is larger for the quasar 
spectrum than for the mean IGM spectrum for each of the 
four cases considered. 

As noted earlier, the progression of overlap during the 
light travel time across the proximity zone leads to the asym- 
metric 21cm spectra shown in Figure [4] However following 
the subtraction of a 4th-order polynomial, the asymmetry is 
less pronounced. To quantify whether the asymmetry could 
be detected in the foreground removed spectra, we subtract 
the points in the foreground removed spectra at negative 



R from points at positive R. This produces curves of resid- 
ual asymmetry AT2i(R) = T2i(R) — T 2 i(—R) consisting of 
5 points which are shown in Figure [7] Inspection of these 
residuals indicates that foreground removal will prohibit the 
detection of asymmetry, which would show up in Figure [7J 
as values of AT21 that differ from zero. To quantify this 
statement, we compute the value of % 2 relative to the null- 
hypothesis of a symmetric model (which would equal at 
each R). We compute this \ 2 an d the associated confidence 
C for each of the Monte-Carlo model spectra assuming a \ 2 
distribution with 5-2=3 degrees of freedom (corresponding 
to 5 points with comparison to a straight line). We then 
construct the cumulative probability distribution P(< C) 
as before. The resulting distributions are plotted in Fig- 
ure |SJ along with the cumulative probability distribution 
Piin(< R) = C (thick grey lines). The left panel shows re- 
sults for spectra of quasar proximity zones, while the right 
panel shows results for spectra of the mean IGM. The dis- 
tributions indicate that asymmetry in the 21cm spectra of 
quasar near-zones could not be detected with high confi- 
dence under the observational conditions assumed in this 
paper. 



6 LUMINOSITY FUNCTION AND NUMBER 
COUNTS OF HIGH REDSHIFT QUASARS 

The advent of large multi-wavelength optical surveys in re- 
cent years has allowed the detailed study of the quasar lumi- 
nosity function to be extended from redshifts corresponding 
to the peak of quasar activity (z ~ 3), out to the end of 
the reionization era at z > 6. Firstly, the density of quasars 
at a redshift of z ~ 6 has been measured using the 8000 
square degrees of imaging from the Sloan Digital Sky Sur- 
vey (SDSS), yielding 21 quasars with z > 5.8 and brighter 
than a z-band apparent AB-magnitude of m z = 20 (Fan et 
al. 2001; Fan et al. 2004; Fan et al. 2006). In addition, fainter 
z ~ 6 quasars have been discovered in a deeper survey of the 
SDSS equatorial stripe (yielding 5 quasars over ~ 125 square 
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Figure 7. The difference (AT21) between the 21cm brightness temperature in front and behind the quasar following subtraction of the 
best fit 4th order polynomial. AT21 is plotted as a function of frequency difference relative to the quasar. A value of AT21 = indicates 
a foreground removed spectrum that is symmetric about the quasar redshift. The four panels show examples corresponding to the 4 
different quasar redshifts described in Figure [4] Also shown are the corresponding examples for the mean IGM, centered on the assumed 
quasar redshifts (grey curves). At z 6.2, the error bars shown correspond to an observation of 1000 hours with the MWA using a 
maximum baseline corresponding to a 3.2' beam. At z = 6.2 and z = 6.0 the errors assume the average signal from stacks of 3 and 10 
quasars respectively. 



degrees brighter than ra z = 21; Jaing et al. 2007). The com- 
bination of deep and wide surveys has allowed the slope of 
the quasar luminosity function to be measured with high ac- 
curacy, and yields a space density of quasars at 2 ~ 6 which 
may be parameterised using the form 

e 6 (M 1450 ) = e^io-°- 4(/3+1)(Ml450+26) , (7) 

where 9g = (5.2T1.9) x 10 _9 Mpc" 3 mag _1 , and /3 = -3.1± 
0.4 (Jiang et al. 2007). The corresponding integral version 
of the luminosity function is 

e 6 (Mi450)dMl450 

-00 

= $jj:xo _0 - 4( ^ +1) ( Ml * 6 ° +26 ) (8) 

where 

= §6 (9) 

6 " -0.41n(10)(/3 + l)' { > 

Comparison of the space density of luminous quasars at 2 ~ 
6 (Fan et al. 2004) with the density measured at 2 ~ 4.3 
(Fan et al. 2001) shows an exponential decline in quasar 
number density with with redshift 

^(Mmbo < -26.7, 2) oc 10 Sxz , (10) 

where B = -0.49 ± 0.07 (Wyithe & Padmanabhan 2005). 



The slope of the luminosity function (/3) changes be- 
tween 2 ~ 4.3 and 2 ~ 6, becoming steeper towards high 
redshift. However we will assume that /3 is constant at 
2 > 6, noting that since we are interested in extrapolation 
to quasars of lower luminosity than those already known, 
this will lead to conservative number counts. With the as- 
sumption of constant /3, and following equation (|10p we next 
write 

e*(z) = e; x io s(z - 6) , (ii) 

yielding the differential and cumulative luminosity functions 

e(A/ 1450 ,2) = 9^ X 10 B(-6) 10 -0.4( /3+ l)(M 1450+ 26) ] ^ 

and 

*(M 146 0,Z) = *6 X W B(z~S) 10 -°-^HM 14 , 0+ 2 6) _ (13) 

These estimates provide a strong empirical basis with which 
to predict the number counts of quasars at moderately larger 
redshifts, but with luminosities comparable to those cur- 
rently observed. 



7 ESTIMATED NUMBER COUNTS FOR 
FUTURE SURVEYS 

Our goal in this section is to estimate the number of quasar 
proximity zones that might be available for study with low 
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Figure 8. Left panel: Monte-Carlo realisations of the confidence with which asymmetry can be detected in the best fit foreground 
subtracted spectrum of a proximity zone. Specifically, the quantity C is the probability that a value of \ 2 larger than that observed 
would arise by chance given a null-hypothesis consisting of a symmetric proximity zone. Right panel: The corresponding distributions 
for spectra of the mean IGM. In each case the plot shows the cumulative distribution of C, at 4 redshifts corresponding to the examples 
shown in Figures 14151 Also shown for comparison is the cumulative probability distribution Pn n (< C) = C (thick grey lines). 




Figure 9. The number of quasars brighter than m z within a 32MHz band whose high frequency end observes the 21cm line at redshift 
z. We plot estimates of the 1-sigma range for a limit m z < 21. The left-hand and right-hand panels show the numbers for an MWA field, 
and a 1200 square degree area. 



frequency arrays. As a specific example we consider the 
MWA, and multiply the density of quasars by the volume 
within an MWA observation in order to estimate the poten- 
tial number counts. Using relations from Furlanetto, Oh & 
Briggs (2006), we obtain the co-moving volume of a cylin- 
der of angular radius 6 and depth Av around a frequency v 
corresponding to the 21cm line at redshift z 

° > (i4H i T £ R^) M -* 

(14) 

Combining this volume with the quasar luminosity function 
we obtain the number counts of quasars per frequency in- 
terval Av at a redshift z brighter than M1450 

dN(M 1450 ,z) Au = V{6A m<M g) 
av 

_ -9g X 10 10 Q.4(/3+l)(M 145 o+26) 

ln(10)(l + /3) 

x ( i T i )"(5^) 10 '"" ,,15) 



Here we have noted that the field of view for the MWA is 

2 A 2 41253 

™ ~ ~~t~. — x — ; square degrees, (16) 

(4m) 2 47T 

with v\ — c (yielding 8 = 12[(l + z)/7] degrees). Integrating 
over the redshift interval corresponding to the MWA band- 
pass of Av — 32MHz, we find 

N(Mu5a,z)= Av, (17) 

Jv-&v A " dv 

where the redshift z = 7(^/204. 1MHz) — 1 corresponds to 
the 21cm line redshifted to the high frequency end of the 
bandpass. We next convert these number counts for an ab- 
solute AB-magnitude M1450 to observed number counts for 
an apparent magnitude limit using the median spectrum 
from the LBQS (Francis et al. 1991), as well as the SDSS 
transmission curves for the i and z-filters. We assume that 
flux is fully absorbed in the Lya forest below an observed 
wavelength 1216(1 + z)A, which is appropriate for 2 > 5.8 
quasars (e.g. White, Becker, Fan, Strauss 2003). 

We present number counts for one MWA field, and also 
for a 1200 square degree region. These are shown in Fig- 
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ure[9]as a function of redshift for four different z-band lim- 
iting AB-magnitudes. The counts are presented assuming 
Av — 32MHz bandpass (which extends over a large fraction 
of a redshift unit). In Figure [9] the labeled redshift corre- 
sponds to the frequency of the redshifted 21cm line at the 
high frequency end of the observing band (see equation 1 17p . 
As a concrete example, it is anticipated that the forthcoming 
Sky Mapper survey will reach i ~ 23 in its six-epoch survey 
(Keller et al. 2007). Since z ~ 6 quasars are discovered using 
an i-dropout criteria [i — z > 2.2; Fan et al. 2001) the cor- 
responding limiting magnitude achieved would be m z ~ 21. 
This limit is shown as the solid line in Figure |J] along with 
the grey shaded region which refers to the 1-sigma uncer- 
tainty in the observed luminosity function. The i-dropout 
requirement limits discovery to z > 5.75, which produces 
the plateau at lower redshifts. Our estimates suggest that 
at most one z > 6.5 quasar (1-cr upper limit) will be dis- 
covered in a 1200 square degree region at m z < 21, while 
one z > 6.25 quasar would be found in each MWA field. 
At slightly lower redshifts the number counts will allow for 
stacking of signal from a number of quasars. For example, 
we would expect to find ~ 5 quasars with z > 6 per MWA 
field. 



8 CONCLUSION 

The introduction of low-frequency radio arrays over the com- 
ing decade is expected to revolutionize the study of the 
reionization epoch. Several studies have been published pre- 
viously, arguing that the observation of the contrast in red- 
shifted 21cm emission between a large HII region and the 
surrounding neutral IGM will be the simplest and most eas- 
ily interpreted signature. These studies have focused on the 
detection of an HII region, formed by the ionizing flux from a 
quasar generating an ionized bubble in a significantly neutral 
IGM. However the highest redshift quasars so far discovered 
(at z ~ 6.4) suggest that the IGM along those lines-of-sight 
is substantially ionized. Thus, more distant quasars would 
need to be discovered in order to find an HII region sur- 
rounding a previously known source. However quasars are 
observed to be extremely rare at high redshifts, and as dis- 
cussed in §0 the prospects for their discovery at z > 6.5 in 
a region of around 1000 square degrees are not good. Alter- 
natively the HII region might be found directly from 21cm 
data. While first generation instruments will detect an indi- 
vidual HII region at modest signal to noise, it has been sug- 
gested that a matched filter approach could be used to find 
HII regions in a blind search (Datta, Bharadwaj & Choud- 
hury 2007). On the other hand, quasar HII regions will have 
a very complex geometry during the overlap epoch, even if 
they emit isotropically (Geil & Wyithe 2007), making the 
blind detection via a matched filter approach more difficult. 

In this paper we have investigated the prospects for 
detection of proximity zones in the highly ionized IGM sur- 
rounding quasars during the late stages of the overlap era. 
We have concentrated on quasars at redshifts where they 
are already known to exist in sufficient numbers to make the 
measurements practical. We employ a semi-analytic model 
which reproduces several post overlap properties of the IGM, 
including the ionizing photon mean-free-path, the mass- 
averaged density of neutral gas and the hydrogen ionization 



rate. In agreement with more sophisticated numerical sim- 
ulations (e.g. Lidz et al. 2007; Bolton & Haehnelt 2007a), 
this model predicts that the Gunn-Peterson Trough will ap- 
pear in the spectra of quasars at z ~ 6.4, even though the 
IGM is highly ionized by that time. We show that while 
quasars at z < 6.5 are likely observed in the post overlap 
IGM, they will still provide valuable probes of the reioniza- 
tion era. This usefulness arises firstly because dense pockets 
of neutral gas will continue to provide a 21cm signal even in 
a highly ionized IGM. Secondly, the light-travel time across 
a quasar proximity zone is probably comparable to the dura- 
tion of hydrogen overlap. As a result, while the IGM studied 
in Lya absorption along the line-of-sight to the quasar may 
be highly ionized, the IGM observed "behind" the quasar 
would be in an earlier stage of overlap and so more neutral. 

We have estimated the 21cm signal corresponding to 
quasar proximity zones as a function of distance from the 
quasar. At z ~ 6.4 our model predicts that while the ex- 
pected contrast in front of the quasar is less than ~ lmK, 
at redshifts beyond the quasar the contrast would be as large 
as 5mK. On the other hand, around a quasar at z ~ 6.0 the 
contrast would be only ~ 0.5 — lmK. Assuming observa- 
tions using the configuration of the MWA, with 1000 hours 
of observing time and a maximum baseline corresponding to 
a 3.2' beam we find that these contrasts could be detected 
in observations of individual quasars at z > 6.4, while at 
2 ^ 6.2 detection would require a stack of observations for 
several quasars. 

In practice he detection of a proximity zone will require 
the subtraction of the large foreground component which 
dominates the redshifted 21cm signal. The process of fore- 
ground subtraction will render any line-of-sight fluctuations 
with wavelengths comparable to, or larger than the band- 
pass undetectable. We therefore fit and subtract a fourth 
order polynomial to our model proximity zone spectra. We 
find that the subtraction removes the low order fluctuations 
including the overall rise across the band-pass. However the 
subtraction leaves residual fluctuations due to the proxim- 
ity zone that would be detectable with the MWA, although 
the asymmetry of the proximity zone due to the evolution 
of the IGM will not. In contrast we find that foreground 
subtraction from the 21cm emission spectra corresponding 
to the mean IGM leaves no detectable fluctuations. This is 
because the overlap epoch will take place over a range of 
redshift that is larger than the frequency bandpass of the 
MWA. Foreground subtraction will render the global step 
in 21cm emission undetectable within a single Av — 32MHz 
bandpass. 

In our model we have used an analytic probability dis- 
tribution for the over density which was fit to numerical 
simulations (Miralda-Escude et al. 2000) . This model allows 
reionization to be computed in an inhomogeneous IGM, and 
provides a framework within which to model the progression 
of reionization from the era prior to overlap when the neutral 
gas is found predominantly in the mean IGM to post overlap 
where dense systems (DLAs) dominate the neutral hydrogen 
content of the universe (Prochaska et al. 2005). At redshifts 
lower than considered in this paper (z < 5) observations of 
proximate DLAs (those within 3000 km/s of the observed 
quasar) show a density of neutral hydrogen that is compara- 
ble to the mean IGM (Prochaska et al. 2007) . On the other 
hand, given the measured galaxy bias of DLAs, an excess of 
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DLAs would be expected in the over dense IGM where the 
proximate DLAs are observed. Prochaska et al. (2007) in- 
terpret this result as evidence that the quasar flux creates a 
proximity zone in the DLAs. Our model includes the mean 
enhancement of over density in the IGM surrounding the 
quasar but does not include a galaxy bias for the galaxies 
that are thought to host the DLAs at more intermediate red- 
shifts. Thus, we expect the model to provide a qualitatively 
correct description prior to and during reionization. However 
at the lowest redshifts considered the unknown properties of 
DLAs could modify our model predictions. 

The density of quasars at z ~ 6 is now well constrained 
(Fan et al. 2006). We have employed the latest measure- 
ments of quasar densities at high redshift to estimate the 
number of quasars that will be discovered in optical-near- 
IR surveys, with specific reference to the numbers that may 
be found in MWA fields. Assuming that the MWA fields 
can be aligned with such surveys we estimate the number of 
quasars that will be discovered per MWA field. One partic- 
ular example is the Sky Mapper survey (Keller et al. 2007), 
which will find around one quasar at z > 6.25, and around 
5 quasars at z > 6 per MWA field. Surveys for high redshift 
quasars (e.g. Sky Mapper) will cover a much larger fraction 
of the sky than is planned for redshifted 21cm observations. 
The fact that upcoming surveys will find more that 1 z > 6 
quasar per MWA field is therefore important, since it means 
that redshifted 21cm observations do not need to be made 
in fields where quasars have been previously discovered. 

In summary we find that if 21cm foregrounds can be 
subtracted to a level below the thermal noise the 21cm emis- 
sion, then proximity zones around high redshift quasars will 
provide a probe of the very end of the overlap era. These 
21cm proximity zones will provide a bridge between mea- 
surements of 21cm intensity fluctuations during the peak of 
the reionization era and studies of Lya absorption following 
the completion of reionization, and so facilitate study of the 
entire evolution of the ionization state of the IGM. 
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